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Abstract. Given an irreducible subshift of finite type X, a sub- 
sliift Y , a factor map ir : X ^ Y, and an ergodic invariant measure 
v on Y, there can exist more tlian one ergodic measure on X which 
projects to v and has maximal entropy among all measures in the 
fiber, but there is an explicit bound on the number of such maximal 
entropy preimages. 



1. Introduction 

It is a well-known result of Shannon and Parry Q that every 
irreducible subshift of finite type (SFT) X on a finite alphabet has a 
unique measure fix of maximal entropy for the shift transformation 
a. The maximal measure is Markov, and its initial distribution and 
transition probabilities are given explicitly in terms of the maximum 
eigenvalue and corresponding eigenvectors of the 0,1 transition matrix 
for the subshift. We are interested in any possible relative version of 
this result: given an irreducible SFT X, a subshift Y, a factor map 
n : X —>■ Y, and an ergodic invariant measure u on Y, how many 
ergodic invariant measures can there be on X that project under tt to 
u and have maximal entropy in the fiber 7r~^{i^}? We will show that 
there can be more than one such ergodic relatively maximal measure 
over a given u, but there are only finitely many. In fact, if vr is a 1- 
block map, there can be no more than the cardinality of the alphabet 
of X (see Corollary |l|, below). Call a measure u on Y n -determinate 
in case it has a unique preimage of maximal entropy. We provide some 
sufficient conditions for vr-determinacy and give examples of situations 
in which relatively maximal measures can be constructed explicitly. 

Throughout the paper, unless stated otherwise X will denote an 
irreducible SFT, Y a subshift on a finite alphabet, and n : X Y a 
factor map (one-to-one, onto, shift-commuting map). By recoding if 
necessary, we may assume that X is a 1-step SFT, so that it consists of 
all (2-sided) sequences on a finite alphabet consistent with the allowed 
transitions described by a directed graph with vertex set equal to the 
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alphabet, and that vr is a 1-block map. In the following, "measure" 
means "Borel probability measure" , C{X) denotes the set of continuous 
real- valued functions on X, A4{X) the space of a-invariant measures 
on X, and S{X) C A^(X) the set of ergodic measures on X. 

Some of the interest of this problem arises from its connections (dis- 
cussed in [n]) with information-compressing channels [11], non-Markov 



functions of Markov chains 0, §|, [n|, measures of maximal Hausdorff 
dimension and measures that maximize, for a given a > 0, the weighted 
entropy functional 

(1) 0„(/i) = —^[h{fi) + ah{7fn)] 

a + 1 



|1^, [T9[, and relative pressure and relative equilibrium states ||^, pO |. 
The theory of pressure and equilibrium states (see [^, H, ^j), rela- 
tive pressure and relative equilibrium states [Bl EO], and compensation 



functions [0, ^ provides basic tools in this area. For a factor map 
vr : X — > y between compact topological dynamical systems and po- 
tential function V G C{X), Ledrappier and Walters ^ defined the 
relative pressure P(vr, V) : y — M (a Borel measurable function) and 
proved a relative variational principle: For each u G Ai{Y), 

(2) [ P{7i,V)du = sup{h^{X\Y) + [ V dfi : fx E n-^i^}. 

JY J X 

Any measure /i that attains the supremum is called a relative equilib- 
rium state. A consequence is that the ergodic measures fi that have 
maximal entropy among all measures in 7r~^{z/} have relative entropy 
given by 

(3) h,{X\Y)= [ \im-log\n-'[yo...yn-i]\du{y). 

(k^^ll/o • • • l/n-i]| is the number of n-blocks in X that map under vr 
to the ra-block yo . . . yn-i-) By the Subadditive Ergodic Theorem, the 
limit inside the integral exists a.e. with respect to each ergodic measure 
V on y, and it is constant a.e.. The quantity 

(4) P(7r,0)(?/) =limsup-log|7r"^[?/o...l/n-i]| 

n— >oo ri 

is the relative pressure of the function over y E Y . The maximum 
possible relative entropy may be thought of as a "relative topological 
entropy over z/"; we denote it by htop{X\v). 

To understand when a Markov measure on Y has a Markov measure 
on X in its preimage under vr, Boyle and Tuncel introduced the idea 
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of a compensation function Q, and the concept was developed further 



by Walters Given a factor map n : X Y between topological 

dynamical systems, a compensation function is a continuous function 
F : X such that 

(5) Py{V) = Px{V OTT + F) for all y G C(r). 

The idea is that, because vr : A4{X) A4{Y) is many-to-one, we 
always have 



(6) PyiV) = snp{h^{a) + J V dp : p e MiY)} 

(7) < sup{/i^((T) + / V oTid^: ^ e M{X)}, 

Jx 

and a compensation function F can take into account, for all potential 
functions on y at once, the extra freedom, information, or free energy 
that is available in X as compared to Y because of the ability to move 
around in fibers over points of Y. A compensation function of the form 
G OTT with G G C{Y) is said to be saturated. 

The machinery of relative equilibrium states and compensation func- 
tions is used to establish the following basic result about relatively 
maximal measures 



Suppose that v G E,{Y^ and TTfi = u. Then fi is relatively maximal 
over V if and only if there is V E C{Y) such that n is an equilibrium 
state of V OTT. 

Notice that if there is a locally constant saturated compensation func- 
tion G o vr, then every Markov measure on Y is vr-determinate with 



Markov relatively maximal lift, because in it is shown that if there 
is a saturated compensation function Govt, then the relatively maximal 
measures over an equilibrium state of \^ G C{Y) are the equilibrium 
states of F o TT + G o TT. 

Further, nx is the unique equilibrium state of the potential function 
on X, the unique maximizing measure for 0o; and the relatively 
maximal measures over /iy are the equilibrium states of G o vr, which 
can be thought of as the maximizing measures for 0oo- 
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2. Bounding the number of ergodic relatively maximal 



Let TT : X — y be a 1-block factor map from a 1-stcp SFT X to 
a subshift Y and let u be an ergodic invariant measure on Y. Let 
Hi, . . . , i^n ^ with TTHi = v for all i. Recall the definition of the 

relatively independent joining ji — /ii <S> • • • <S>v fJ-n Hi, . . . , /in over v. 
a Ai, . . . ,An are measurable subsets of X and is the cr-algebra of Y, 
then 



Writing pi for the projection X" — >^ X onto the i'th coordinate, we 
note that for /i-almost every x in X", 7r(pj(x)) is independent of i. 

We define a number of a- algebras on X". Denoting by Bx the cr- 
algebra of X and by By the cr-algebra of y, let Bq — (f)~^BY, Bi — 
pJ^Bx for i = 1, . . . , n, Bx the cr-algebra generated by x„, n < 0, and 
B~ = p~^Bx for each i. Note: later wc will use the same symbols for 
corresponding sub-cr-algebras of a different space, Z = X x X x R. 

Definition. We say that two measures /ii,/i2 € ^{^) with tt/Xi = 
7r/X2 = are relatively orthogonal (over z/) and write fii -L^ I-I2 if 

(9) {iJ,i(S)^ iJ,2){{u,v) e X X X -.uo^vo} ^0. 

Theorem 1. For each ergodic v on Y , any two distinct ergodic mea- 
sures on X of maximal entropy in the fiber 7r~^{i/} are relatively or- 
thogonal. 

Since tt is a 1-block factor map, for each symbol h in the alphabet of 
F, 7r~^[6] consists of a union of 1-block cylinder sets in X. Let N^{7t) 
denote the minimum number of cylinders in the union as b runs over 
the symbols in the alphabet of Y for which u[b] > 0. 

Corollary 1. Let X be a 1-step SFT, Y a subshift on a finite alphabet, 
and TT : X — > y a 1-block factor map. For any ergodic u on Y, the 
number of ergodic invariant measures of maximal entropy in the fiber 
7r~^{i/} is at most N^{t:). 

Proof. Suppose that we have n > N^{tt) ergodic measures /ii, . . . , /in 
on X, each projecting to u and each of maximal entropy in the fiber 
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7r^^{z/}. Form the relatively independent joining /t on X" of the mea- 
sures /ij as above. Let 6 be a symbol in the alphabet of Y such that 
h has Nyij:) preimages ai, . . . , aAr^(^) under the block map tt. Since 
n > iVj,(7r), for every x G there are i j with {pix)Q = {pjx)Q. 

At least one of the sets Sij = {x E : {pix)o = {pjx)o} must have 
positive /t-measure, and then also (yUj fij){{u,v) E X x X : tcu = 
7iv,uo = vq} > 0, contradicting Theorem |l]. □ 

Corollary 2. Suppose that n : X ^ Y has a singleton clump: there 
is a symbol a of Y whose inverse image is a singleton, which we also 
denote by a. Then every ergodic measure on Y which assigns positive 
measure to [a] is Tt- determinate. 

Before giving the proof of Theorem |l], we recall some facts about 
conditional independence of cr-algebras (see |TD|, p. 17]) and prove a 
key lemma. 

Lemma 1. Let {X,B,fi) be a probability space. For sub-a -algebras 
Bo,Bi,B2 ofB, the following are equivalent: 

(1) Bi _L/3(, B2, which is defined by the condition that for every Bi- 
measurable fi and B2-measurable f2, E(/i/2|i3o) = E(/i|i3o)]E(/2|i3o) 

(2) for every B2-measurable f2, E(/2|;Bi V Bq) = E(/2|i3o); 

(3) for every Bi-measurable fi, E(/i|i32 V Bq) = E(/i|i3o). 

Lemma 2. Let {X,B,^) be a probability space and let Bi, B2,Ci,C2 be 
sub-a -algebras of B. If Bi ±0,, B2,Ci C Bi,C2 C B2, then for every 
Bi-measurable fi, 

(10) E(/i|i3oVCiVC2) =E(/i|SoVCi). 

Proof. First note that Bi J-B0VC2 -^2, since for i3i-measurable /i we 
have E(/i|(So V C2) V B2) = E(/i|i3o V B2) = E(/i|So) = E(/i|So V C2). 
Similarly, Bi J-BovCi ^2 and Bi J-HovCi C2. Thus for any /i that is 
;Bi-measurable, E(/i|(-Bo V Ci) V C2) = E(/i|So V Ci). □ 

Lemma 3. Let tt : X ^ Y be a 1 -block factor map from a 1-step SFT 
X to a subshift Y . Let u be an ergodic measure on Y and let /ii and 
fi2 be ergodic members of 7r~^{i'}. Let fi be their relatively independent 
joining. If S = {{u,v) E X x X : m_i = t>_i} has positive measure 
with respect to fi and for every symbol j in the alphabet of X 

(11) E^(ly] opi|i3r V Bo) = Ei,{ly]op2\B2 V Bo) a.e. on S, 
then /ii = /i2- 
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Proof. Write [i]k for the set of points in X whose fc'th symbol is i and 
filSf '' for pj^[i]k. Write lr.,(j) for the indicator function of this set. Define 

-' W k 

= E(lj.j(,)|i3oVi3j) and set Sk = Xli ^[iif^^^H^') = '^{{u,vy.uk=vk}- Note 
that s_i = 15. 

Let V denote the time-0 partition of X into 1-block cylinder sets, 
Vi = p^^V {i = 1, 2) the corresponding partitions of X x X, and 
T = a X a. 

By assumption, we have S-igf^^ = S-igf^ for all symbols i in the 
alphabet of X. Taking expectations with respect to Bi V TP2, since 
S-igf'' is Bi V TP2-nieasurable, we see that 



:i2) '"'V ^(ibiii^i) '^'^-^ 



where the last equality follows from Lemma |T], noting that Bq C Bi. 
Observe that the terms in the final expression are all measurable with 
respect to Bo V TVi V TV2. 

It then follows that 

(13) ^ = E(s_i(7f^|SoVTPiVTP2) = s_iE((7f ^ |i3oVTPi VTP2). 

Since gf^ is ;Bi-measurable and Bi and B2 are relatively independent 
over So, by Lemma || the right side is equal to s_iE((7-^'*|So VTPi). We 
have thus established the equation 

(14) s.rn9?\Bo V TV,) = s_,gf^ = s.,gf = s_iE(^7f |i3o V TV2). 

Starting from the equation s^igf^ = s_iE{g^'^^\Bo V TV2), we take 
conditional expectations with respect to Bi to get 

(15) E(s_i|i3i)(7f ^ = E{s^iE{gf^\Bo V TV2)\Bi). 
We have 

Etef 1,„„,|B„) 



(16) EtofHvTP.) = E Eu'W ^I-i^i 



MEASURES OF MAXIMAL RELATIVE ENTROPY 



7 



Hence 

k ^ [k] 



(17) ._,E(,f'|eo V m) = J: e(i "!jB„r ^w^rwg- 



Substituting this in (|1^) and again using relative independence, we 
see that 

E(9f>i „|e„) 

-.(2) 



We observe that the right-hand side and also E(s-_i|Si) are Bq V TPi- 
measurable (using the definition of s_i and relative independence). 
Hence provided that E(s_i|i3i) > a.e., we will have that gl^^ is Bq V 
TPi-measurable, and similarly g^"^^ is Bq V T'P2-measurable. 

We now demonstrate that E(s_i|i3i) > on a set of full measure. 
To prove this, we note that E(s_i|i3i) is of the form f o ioi f a 
function on X. Thus if we can show that E(s_i|i3i)(a;) > implies 
K{s-i\Bi){Tx) > 0, it will follow that the set where / is positive is 
invariant and hence of measure or 1 by ergodicity of fii. Since the 
integral of the function is positive (being equal to (/xi <^i, fi2){{u,v) : 
U-i = V-i}), to show that the function is positive on a set of full 
measure it is enough to establish the above invariance. 

Now 

E(s_i|Si)(Tx) = E{so\Bi){x) 

i 



(19) 



^1 (i)E(E(s_il (2)|Si VTP2)|Si). 
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Using Lemma this equals 

^lj^jwE(s_iE(lj.j(.)|i3iVTP2)|i3i) 

i 

i 

= J2\4^^Ms^i9P\Bi) (from (14)) 

(20) 

i 
i 



■JO 

i 



For X in a set of full measure, Id{x) > implies E(l£)|jF)(x) > (con- 
sider integrating the conditional expectation over the set where it takes 
the value 0), so the sum on the right-hand side of the above is positive 
almost everywhere. Since the first factor is positive by assumption, the 
conclusion that E(so|i3i) > follows, allowing us to deduce that (^p'' is 
Bo V TPj-measurable. 

Now we may write 5^^^ as 

(21) ^l^^ = Eiw<fi^S' 

k 

where the h^^] are i^o-measurable. Writing out the equation S-iqI^^^ = 
s_igl'^\ we have 



Since for distinct k, the terms are disjointly supported, we have for 
each k, 

(23) ^WL'I^WL'I^S = ^lk]^^l^[kfl^?}- 

Taking conditional expectations of both sides with respect to Bq and 
using Lemma |l], we deduce 

(24) E(lj,ja)|i3o)E(lj^j,.)|i3o)(/^2-/^S) = a.e. 
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From this wc see that if E(l, ,(i) |;Bo) > and E(L ,{2)|i3o) > 0, then 
= h^kl- This allows us to make the following definition: 

(25) hk,i = 



/ig ifE(lj^ja)|So)>0 
/.g ifE(lj,j(.j|So)>0. 



It follows that 

(26) ^J-'^ = ^ /ife,a[fe]0) A-a.e.. 

k 

Wc now show that the two measures agree. We will show by in- 
duction on the length of the cylinder set that for any So-measurable 
function / and any cylinder set C in X, 

(27) j Islcopifdfj.^ J Isle o P2f dfc. 

To start the induction, let C be the cylinder set [io] in X. Then 
J lsl[i,]U)fdil = J lsfm[io]U) V V B^) djl 



(28) 



= / Isfc^ djl; 



but by assumption IsQi^^ = ^sQi^K showing the result in the case that 
C is a cyhnder of length 1. Now suppose that the result holds for 
cylinders of length n and let C = [io . . .in]. Write D — [io ■ ■ -in-i]- 
Now 
(29) 

J is{'^c°Pj)fdfi = J ls{lDopj)l^.^^u)f dp, 

= J l5(lDop,)/E(lj.^jO)|T-"i3rvT-"i32-Vi3o)d/i 
= J ls{lDopj)fgiiKT^dpi 

= J ls{lDOPj)fhi^_,,i„oT^dfl. 

Since hi^_-^^i^ o is ;Bo-measurable, it follows from the induction hy- 
pothesis that the integrals are equal for j — 1 and j = 2 as required. 

In particular, taking / to be 1, we have fi{S Dp'^^C) = jj,{S np2^C) 
for all C. Letting ^{A) — p,{Sr\A), we see that Oop]^^ — vop^^. Since 
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/^i(^) > ^°P7^{^) for S'll ^ and the measures fii are ergodic, it follows 
that /ii and /X2 are not mutually singular and hence are equal. □ 



Proof of Theorem Let fii and fi2 be two different ergodic relatively 
maximal measures over u G S{Y) and suppose that they are not rela- 
tively orthogonal, so that {fii^ufJ'2){{u,v) G X xX : uq = Vq} > 0. Let 
/i = /ii ®iy /i2- We will construct a measure on X with strictly greater 
entropy than fii or fi2 by building a larger space from which the new 
measure will appear as a factor. (J. Steif reminded us that a similar 
interleaving of two processes is used in [^] for a different purpose.) 

Let R denote the set {1, 2}^, and let [3 be the Bernoulli measure on R 
with probabilities |, ^. Write {rn)n& for a typical element of R. Form 
Z = X"^ X R with invariant measure r] = fi x f3. We then define maps 
from Z to X as follows. Given a point {u, v, r) G Z, set ni{u, v, r) = u, 
Tr2{u,v,r) = V and write Nk{u,v) for sup{n < k: Un = fn}- Note that 
this quantity may be — oo if there are no coincidences. We will take 
r_oo to be a further random variable taking the values 1 and 2 with 
equal probability for each r G -R. Define tts : Z ^ X by 



(30) iT3{u,v,r)k 



Uk if ^NkM = 1 
Vk if rNkiu,v) = 2 . 



To see that tt^Iu, v, r) is indeed a point of X, note that it consists of 
concatenations of parts of u and v, changing only at places where they 
agree. As a corollary, since tt{u) = 7r(f) for almost all {u,v,r) G Z, it 
follows that 7[{7[3{z)) = tt{tt2{z)) = 7r(7ri(z)) for r^-almost every z in Z. 
Write $ for the factor mapping tt o vri from {Z, t]) to {Y, v). 

By construction jJLi = rjo vrf^ and fi2 = V ° '^2^- Define fxs = r] o n^-^. 
We shall then demonstrate that h^^{X) > h^^{X) = h^^{X). 

We define cr- algebras on Z corresponding to those appearing above. 
Letting Bx be the Borel cx-algebra on X as before, we set for each 
i = 1,2,3, Bi = TT^^Bx- Write Bx for the cr-algebra generated by the 
cylinder sets in X depending on coordinates x„ for n < 0. These then 
give cr- algebras B~ on Z defined by B~ = n^^Bx- We will require two 
further cr-algebras, Bq = $~^i3y with Bq being defined analogously to 
the above. Note that Bi D Bq for i = 1,2, 3. 

Again reusing previous notation in a slightly different context, con- 
tinue to denote by V the partition of X into time cylinders and write 
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Vi for TTj^^'P, SO that for i = 1,2, 3, Vi is a partition of Z. Finally, write 
Q = : [j] is a cylinder set in Y}. 

It is useful to note the following property of (|]): If Ai G Bi and 
A2 e B2, then 

(31) r/(AinA2) = jErjilAAl3o)KiilA,\Bo)dv- 

We will use the fact that if / is i3i-measurable, then 

(32) E,{f\B2)=E,if\Bo), 

a consequence of Lemma ^ 

Standard results of entropy theory tell us that hfj,.{X) = Hrj(Vi\B^). 
Further, by Pinsker's Formula (see [0, theorem 6.3, p. 67], applied 
with /3 coarser than a), this can be re-expressed as 

(33) h^^iX) = H,{V,\B;yBo)+H,{Q\B^) = H^{V,\B-yB,) + K{Y). 

Since /xi and /i2 were presumed to be measures of maximal entropy 
in the fiber, they have equal entropy and hence H^{Vi\Bi V Bq) = 
Hrj{V2\B2 V Bq). Our aim is to show that this leads to a contradiction 
by showing that //^(Psl-^a V Bo) > Hr,{Vi\B^ V Bq). By definition, 

H^{Vi\Br VBo) = j - ° ^0 logE(l[,] o 7i,\B- V B^) dr] 



(34) 



j -Y. ^(^W ° ^^1^*^ ^ logE(l[,] o 7ii\B; V Bo) dr] 
J]V'(E(lb]0 7r,|i3- VEo)) dr]. 



where is the strictly concave function [0, 1] [0, 1], ipi^x) = — xlogx 
(with tp{0) defined to be 0). 

The following claim is an essential point of the argument. We shall 
show that 
(35) 

E^(l[,] o TTal^r V B2 V B^ V Bo){z) = 
E^(ly] o ni\B^ V Bo) if vr3(2;)_i = 7ri(2)_i ^ n2{z)_i; 
E^(ly] o n2\B2 V Bo) vr3(2;)_i = n2{z)_i ^ t^i{z)_i; 

^E^(ly] oTTil^j; y Bo) + ^E^(l[j] 07121^2" Vi3o) 7r3(z)_i = 7ri(2;)_i = 712 
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Clearly, the right-hand side of the equation is measurable with respect 
to V 82 y y Bq. To verify the claim, it will be sufficient to 
integrate the right-hand side over the elements of a generating semi- 
algebra of Bi \/ B2 \/ B^ y Bq. Specifically, we will integrate over sets 
of the form AdB nC HD, where A, B and C are the preimages under 
the respective maps of cylinder sets in X of a common length (ending 
at time —1) and D e Bq. 

Suppose A, B, and C are cylinders depending on the coordinates 

—n to —1 of 7ri(2;), 7r2(2;), and n^lz) and that Af] B r\C has positive 
measure. Then for z E A Cl B Cl C, 7r3{z)_i is equal to either 7ri(2;)_i 
or n2{z)^i (or both) by definition of 713. Further, t:i{z)-i, 7T2{z)-i, and 
7r3(2;)_i are constant over the intersection in question. 

If on yl n 5 n C, TT3{z)-i = 7ri(2;)_i 7^ t:2{z)-i, then we calculate 



Performing first the integration over R with respect to the measure /3, 
we see that the only factor depending on the random part r e i? is 
1(7, the others being functions only of {u,v) G X^. The coordinates of 
Tisiz) from —n to —1 are concatenations of blocks of 'n'i{z) and 7i2{z), 
the choice (between a block in u and a different block in v) being made 
according to the entries in r, hence with probabilities 1/2,1/2). If 
k — kA,B{u, v) — l-|-card{j : —n < j < —2, Uj — vj, Uj+i ^ fj+i}, then 



which is constant on A n B. The following calculation will be more 
readable if we write E^/ for E(/|B). Since B e B2 and B2 J-Bq Bf , we 



(36) 




(37) 
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have E^i^^'^oifi = E^oi^. Consequently, 



E^(ly]0 7ri|i3r VSo)(^) dv 

AnBncnD 

[ 2-''lDlBlA^^^^'^°ily] o Tti) diliu,v) 

f 2-'=E^r^^"[lsE^'^^«(lz,lA ■ (ly] o TTi))] dfi{u,v) 



(38) 

2-'=[E^»1b][E^^"^^°(1d1a ■ (![,] o TTi))] ci/i(M, v) 

X2 
X2 



by (|3lD , since B,D E B2 and y4,7rf^[j] G This demonstrates the 
desired equahty in the case 7r3(2;)_i = 7ri(z)_i 7^ 7r2(^)_i. The case 
n2,{z)_i = 7r2(z)_i 7^ 7ri(2;)_i is deah with similarly. 

If H3{z)^i = 'Ki{z)_i = vr2(-i;)_i, then the integrand is the average of 
the two previous integrands, so we see that 
(39) 

/ (|E^(lb] o vri|i3r V Bo) + o n2\B^ V Bo)) drj = 

J AnBncnD 

lr]{A nBnCnDn n^^[j]) + ^vi^ nBnCnDn n^^[j]) = 
r]{AnBnCnDn7T^^[j]). 



This completes the proof of equation (pS]). 
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Using (0), we have 
(40) 

Hr,{V3\B^ V Bo) > H,,{P;\B^ V B^ V B^ V Bo) 

= [YI ^(^^(Ibl ° ^3|i3r V V B^ V i3o)) cir/. 

We separate the integral into parts according to whether 773 (2;) _i is 
equal to 7ii{z)^i, 7r2(z)_i or both. Let Si = {z: 7r3(z)_i = ■ni{z)^i ^ 
7r2(^)_i}, 5*2 = {z: TX3{z)_i = 7r2(z)_i 7^ 7ri(2;)_i} and = {z: n3{z)^i = 
7ii{z)_i = 7r2(z)_i}. Let A = {z: 7ri(z)_i 7^ 7r2(2;)_i} so that A = 
5*1 U 5*2. Note that and 5*2 have equal measure by definition of tts. 

By symmetry, 
(41) 

/ V^(E,(ly]0 7ri|i3r VSo)rfr/= / V V'(E,(ly] o 7ri|i3r V ^o) rf^?, 
so by (HD, 

/ V ^(E^(l[,] o TTali^r V ^2^ V B^ V i3o)) rfr/ = 

(42) -/^i Y 

|/AE,^(E,(ly]0 7ri|i3rVi3o)rfr7. 

Similarly, 

/ V mvihi] ° ^al^r V B^ V S3- V Bo)) dr] = 

(43) Y 

iLE,V^(E,(l[,]0 7r2|i32"VSo)c?r/. 

Finally, integrating over S3, 
(44) 

/ V^(E^(1[,] o n3\B^ V i32- V iSg- V Bo)) df] = 

1^1 V Bo) + E^(ly] o 7r2|i32 V Bo))) dr] > 

I Ia. (^(E,(ly] o TTili^r V i3o) + ^(E,(ly] o 7r2|S2" V So)) dr]. 

The strict inequality in the above arises since ip is strictly concave 
and there exist a j in the alphabet of X and a set of points of positive 
measure in A'^ = {{u,v,r) ^ Z = x R : m_i = for which 

E^(l[j] o 7ri|i3|f V So) 7^ lE^(l[j] ° 7r2|S2" V Bq) — for, if not. Lemma | 
would imply that /xi = /i2- 
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Now adding the preceding equalities, we see 
(45) 

Hr,{V3\B^ V Bo) > 

\ (/ 5ZV^(E^(lb]°^il^rvSo)t^r/ + j ^^(E,(ly]0 7r2|i32" VSo)^^^ 

V i i 
= \{H{Vi\B^ V i3o) + H{V2\B:, V Bo)) 
= h^,{X)-K{Y). 

From (|33D , we see that h^^{X) > h^^{X) as required. □ 

Remark. It would be desirable to have a proof of this result based on 
the Shannon-McMillan-Breiman Theorem, but so far we have not been 
able to construct one. 

Definition. Let (X, i3, /i, T) and (F, C, z/, S) be measure-preserving sys- 
tems, TT : X ^ y a factor map, and a a finite generating partition for 
X. We say that /i is relatively Markov for a over Y if it satisfies one 
of the following two equivalent conditions: 

(1) a ±r-iaV7r-ic oif ; 

(2) H^{a\af V vr-^C) = //^(a|T-ia V t^-^C). 
(As usual, a^' = Mi^^T-'^a.) 

Corollary 3. If X is a 1-step SFT, Y is a suhshift, -k : X ^ Y is a 
1 -block factor map, v is an ergodic measure on Y , and ^ is an ergodic 
relatively maximal measure over v, then fi is relatively Markov for the 
time- partition of X overY. 

Proof. We apply the first half of the proof of Lemma ^ with = ^2 = 
fi. Note that then /i(S') > 0. If s^ig^^^ = s^igf^ for all symbols i 
in the alphabet of X, the proof proceeds as before to show that the 
information function with respect to /i of the time-0 partition P of X 
given V n'^By is measurable with respect to P V a~^V V 7r~^i3y, 
and hence /i is a 1-step relatively Markov measure. 

If there is a symbol i in the alphabet of X for which s^igf^^ 7^ S-igf'\ 
then the construction in the proof of Theorem |I|, by interleaving strings 
according to another random process, will again produce a measure 
projecting to v which will have entropy greater than /i(/i). □ 
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3. Examples 



Example 1 . In case tt has a singleton clump a and ly is Markov on Y, we 
can construct the unique relatively maximal measure above u explicitly. 
Denote the cylinder sets [a] in X and in Y by Xa and Ya, respectively. 
If 1/ is (1-step) Markov on Y, then the first- return map aa : Ya ^ Ya is 
countable-state Bernoulli with respect to the restricted and normalized 
measure z/^ = z//z/[a]: the states are all the loops or return blocks aC* 
with aC^a = ac\ . . . c\..a appearing in Y and no c* — a. 

Under tt"^, the return blocks to [a] expand into bands aB^'^ , with 
aB^'^a appearing in X and Tri?*'-' = C* for all Topologically, 
{Xai(Ta) is a countable-state full shift on these symbols aB'^'K We de- 
fine to be the countable-state Bernoulli measure on {Xa, (Ta) which 
cquidistributes the measure of each loop (state) of Ya over its preimage 
band: 



We show now that this choice of jia is relatively maximal over i/a- Let 

\a be any probability measure on Xa which maps under vr to z/^. Then 
the countable-state Bernoulli measure on Xa which agrees with on 
all the 1-blocks aB'''^ (its "Bernoullization" ) has entropy no less than 
that of \a and still projects to the Bernoulli measure Va-i so we may as 
well assume that is countable-state Bernoulli. If \a{aB'^'^\ — q'^'^ and 
\TT~^{aC^a) \ = Ji for all then 



(46) 



f^a[aB''^] 



UalaC^a] 



for all 



n-^[aC^a]\ 



OO Ji 



(47) 




i=i j=i 



Note that for each i 



(48) 




is fixed at the same value for all Xa- Thus for each i, 



Ji 



(49) 




is maximized by putting all the g*'-' equal to one another. 



Finally, this unique relatively maximal over determines the 
unique relatively maximal /i on X over i/ on Y, since according to 
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Abramov's formula 

(50) h{X,a,^) = fl[a]h{Xa,CTa,fia), 

and fi[a] = i^[a]. 

We show how this calculation of the unique relatively maximal mea- 
sure over a Markov measure in the case of a singleton clump works 
out in a particular case. It was shown in |19| that for the follow- 



ing factor map there is a saturated compensation function G o n with 
G G C(Y) but no such compensation function with G G J^{Y). There 
is a singleton clump, a. 




For each k > 1 the block ab'^a in Y has k + 1 preimages, depending 
on when the subscript on b switches from 1 to 2. Let u be Markov on 
Y. To each preimage aBiaB2a . . . aB^ of ab^^ab^^ . . . ab^^ the optimal 
measure /i^ assigns measure 

(51) fia[aBiaB2a . . . aBr] = -j-^ — . . . -j-^ — Ua[ab''^ ab''^ . . . ab'''']. 

The unique relatively maximal measure over Va can be described in 
terms of fiber measures as follows. Given y = ab^^ab^^ . . . ab^'' ■ ■ ■ G l^a, 
/ia y chooses the preimages of each 6*^* with equal probabilities and 
independently of the choice of preimage of any other b^^ . Then 

(52) ^a[0'BiaB2a . . .aBf] = I fia,y[0'BiaB2a . . . aBr] duadi). 

Example 2. The relatively maximal measures over an ergodic mea- 
sure i> on Y which is supported on the orbit 0{y) of a periodic point 
y = CCC ■ ■ ■ E Y can be found by analyzing the SFT Xy = n'^O^y). 
The relatively maximal measures over u are determined by the max- 
imal (Shannon-Parry) measures on the irreducible components of Xy. 
Consequently, if Xy is irreducible, then the discrete invariant measure 
on the orbit of y is vr-determinate. 
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Example 3. Failure of ir-determinacy for a fully-supported measure. In 
the preceding example, along with others discussed in failure of 



TT-determinism can be blamed on lack of communication among fibers. 
An example suggested by Walters (see ||2^) also shows that there can 
be fully supported u on Y which are not vr-determinate. For such ex- 
amples there are potential functions V G C{Y) such that Von has two 
equilibrium states which project to the same ergodic measure on Y. 

In this example, X = Y = T,2 = full 2-shift, and 7r(x)o = xq + xi 
mod 2 is a simple cellular automaton 2-block map. If we replace X by 
its 2-block recoding, so that vr becomes a 1-block map, we obtain the 
following diagram: 




This is a finite-to-one map and hence is Markovian — for example, 
the Bernoulli 1/2, 1/2 measure on S2 is mapped to itself. The constant 
function is a compensation function. Thus every Markov measure on 

Y is TT-determinate: the equilibrium state yUy of a locally constant V on 

Y lifts to the equilibrium state of V o n, which is the unique relatively 
maximal measure over /iy (in fact it's the only measure in 7r~^{/iy}). 

For every ergodic u on Y, all of vr^^jz/} consists of relatively maximal 
measures over z/, all of them having the same entropy as u. 

If p 7^ 1/2, the two measures on the SFT X that correspond to 
the Bernoulli measures B{p, I — p) and B{1 — p,p) both map to the 
same measure Up on Y. Thus Up, which is fully supported on Y, is not 
TT-determinate. (An entropy-decreasing example is easily produced by 
forming the Cartesian product of X with another SFT.) 

Moreover, Up is the unique equilibrium state of some continuous func- 
tion Vp on y [|15|. Then the set of relatively maximal measures over 



Up, which is the entire set n~^{i>p}, consists of the equilibrium states of 
VpOTT-\-Go7r = VpOTT |2^, so this potential function VpO n has many 
equilibrium states. 
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Example 4. Homogeneous clumps. In the following example there is no 
singleton clump, but the clumps are homogeneous with respect to vr 
so there is a locally constant compensation function 



see 



and hence every Markov measure on Y is vr-determinate and its unique 
relatively maximal lift is Markov. 



C 



ai 



C 



a2 



In this case the return time to [a] is bounded, so Xa is a finite-state 
SFT rather than the countable-state chain of the general case. There 
are six states, and a2ai, according to 

the time entries of s G Xa and aaX. Fix this order of the states for 
indexing purposes. It can be shown by direct calculation that for this 
example a stochastic matrix P determines a Markov measure on Xa 
that is relatively maximal over its image if and only if it is of the form 



(53) 



/x 


1 


- 2x 


X 












y 


1 


-2y 


y 
























X 


1 


- 2x 


X 













X 


1 


- 2x 


X 













y 


1 


-22/ 


y 


\a; 


1 


- 2x 


X 










0/ 



(In this case the image measure is also Markov.) 

Here 0<x, y<l/2 and the probability vector fixed by P is 

(54) p = ; -(y, 1 — 2x, y,y,l — 2x, y). 

V ; y 4y + 2(l-2a;)^^' 

Further, given a (1-step) Markov measure p on Y, put K = v[aa\/ v[aha\. 
Then a stochastic matrix of the form (^) with fixed vector p satisfies 
P1+P3+P4+P6 = i'[aa\ andp2+P5 = i^[a&a] (so that the Markov measure 
/i that it determines projects to v) if and only ii x = y = Kj {2K + 2) 
(and then [i is relatively maximal over v). 

Example 5. Singleton clump after recoding. Make the preceding exam- 
ple a little bit more complicated by adding a loop at 61, so that now 
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the return time to [a] is unbounded. It can be verified that now there 
is still a continuous saturated compensation function, but there is no 
locally constant compensation function, so the code is not Markovian. 
However, if we look at higher block presentations of X and Y, we can 
find singleton clumps, for example abba. Therefore again every Markov 
measure on Y is vr-determinate. 



C 



ai 



c 



02 



Example 6. No singleton clumps. Complicating Example ^ a bit more, 
we can produce a situation in which there are no singleton clumps, not 
even for any higher block presentation. 




For this example it can be shown that there is a continuous saturated 
compensation function G o n, but we do not know exactly which mea- 
sures are vr-determinate. Although the example appears simple, the 
question of how many fibers allow how much switching is complex. 
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